Educative: Interactive Courses for Software Developers

Most programming languages support a data type for real numbers, called float or double. SQL supports a similar data type of the same name. Many programmers naturally use the SQL FLOAT data type everywhere they need fractional numeric data because they are accustomed to programming with the float data type.

The FLOAT data type in SQL, like float in most programming languages, encodes a real number in a binary format according to the IEEE 754 standard. We need to understand some characteristics of floating-point numbers in this format to use them effectively.

Rounding by necessity#

Many programmers are not aware of a characteristic of this floating-point format: not all values that can be described in decimals can be stored in binary. Out of necessity, some numbers must be rounded to a very close value.

To give some context for this rounding behavior, let’s look at rational numbers that have repeating decimal numbers, such as one-third, written as “0.333…”. The true value cannot be represented in decimal because we must write an infinite number of digits. The number of digits is the precision of the number, so that a repeating decimal number would require infinite precision.

The compromise is to use finite precision, which is choosing a numeric value as close as possible to the original value, for example, “0.333”. However, this means that the value isn’t the same number that we intended.

1/3 +1/3 +1/3 = 1.000

0.333 + 0.333 + 0.333 = 0.999

Even if we increase the precision, we still can’t add three of these approximations of one-third to get a true value of 1.0. This is the necessary compromise of using finite precision to represent numbers that may have repeating decimals.

1/3 +1/3 +1/3 = 1.000000

0.333333 + 0.333333 + 0.333333 = 0.999999

This means that many legitimate numbers that we can imagine cannot be represented with finite precision. We may think it to be acceptable because we can’t really type a number with infinite digits anyway, so, naturally, we should make do with typing any number with finite precision and storing it precisely — right? Unfortunately not.

IEEE 754 represents floating-point numbers in a base-2 format. The values that require infinite precision in binary are different values from those that behave this way in decimal. Some values that only need finite precision in decimal, for instance, 59.95, require infinite precision to be represented exactly in binary. The FLOAT data type can’t do this, so it uses the closest value in base-2 it can store, which is equal to 59.950000762939 in base-10.

Some values coincidentally use finite precision in both formats. In theory, if we understand the details of storing numbers in the IEEE 754 format, we can predict how a given decimal value is represented in binary. But in practice, most people won’t do this computation for every floating-point value they use. We can’t guarantee that a FLOAT column in the database will be given only values that are cooperative, so our application should assume that any value in this column may have been rounded.

Meet the IEEE 754 format

Some databases support related data types called DOUBLE PRECISION and REAL. The precision that these data types and FLOAT support varies by database implementation, but they all represent floating-point values with a finite number of binary digits, so they all have similar rounding behavior.